CDS

Accession Number TCMCG021C03223
gbkey CDS
Protein Id XP_010908888.1
Location 42593..43969
Gene LOC105035146
GeneID 105035146
Organism Elaeis guineensis

Protein

Length 458aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268357
db_source XM_010910586.3
Definition putative L-cysteine desulfhydrase 1 [Elaeis guineensis]

EGGNOG-MAPPER Annotation

COG_category E
Description Isopenicillin N
KEGG_TC -
KEGG_Module -
KEGG_Reaction R00782        [VIEW IN KEGG]
KEGG_rclass RC00382        [VIEW IN KEGG]
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
KEGG_ko ko:K22207        [VIEW IN KEGG]
EC 4.4.1.28        [VIEW IN KEGG]        [VIEW IN INGREDIENT]
KEGG_Pathway ko00270        [VIEW IN KEGG]
map00270        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGGATCCCCACAGAGACGGCCACGCCGAGAACGGCCGCCACGATGGCGACGGTGGCGGCGACGACACCAACGGGCCCCTTCCGAAGCGGCCGCGGCCTTCTCCACCCATCTCCACCGCTGAGATCCGCGACGAGTTCTCCCACCACGACCCCGCCGTAGCCCGCGTCAACAACGGCAGTTTCGGCAGCTGCCCGGCCTCCGTCCTCGCCGCCCAGCTCCGGTGGCAGCGCCTCTTCCTCCGCCAGCCCGACGATTTCTACTTCAACCGTCTCCAGCACTCCATCCTCCGCTCTCGTGCGCTCATCAAAGACCTCATCAACGCGGACGACATCGACGAGGTCTCCCTCGTCGACAACGCCACGACCGCCGCCGCCATCGTCCTCCAGCACGTCTCCTGGTCCTTCACCGAGGGCCACTTCAACAAGGGCGACGCCGTCGTCATGCTCCACTACGCCTACGGTGCCGTCAAGAAGTCCATCCAGGCCTACGTCACCCGCGCCGGCGGCCATGTCATCGAGGTCCCCCTACCGTTCCCGGTGACCTCCAACGAGGAGATCGTTCAAGAATTCCGCAAGGCGTTGGAGCTCGGGAAATCCAATGACCGGAAGGTCCGGCTGGCCGTGATCGACCACATTACTTCGATGCCGAGCGTCGTGATCCCTGTCAAAGAATTGACCAAGATTTGCCGCGAGGAGGGTGTAGATCAGGTGTTTGTTGATGCGGCGCATGCAATCGGGAGCATCGAGGTTGACGTGAAAGACATAGGGGCTGATTTCTACACCAGCAACCTCCACAAGTGGTTCTTCTGCCCCCCTTCGGTTGCGTTCTTATACTCCAAGAAGAGCAGGGCTTCATCCAATTTGCACCACCCAGTGGTCTCACACGAGTATGGGAATGGTCTTCCAATCGAGAGCGGGTGGGTTGGTAACCGCGATTACAGTGCCCAGCTTGTAGTGCCATCAGTGATGGATTTCATTGATAGGTTTGAAGGGGGGATTGAAGGCATTAGGAAGCAGAATCACGATAAGGTTGTGGAGATGGGGAAGATGCTGGCTGAGTCATGGCTCACTTGTCTTGGATCGCCGCCAGATATGTGCTCGAGCATGATCATGGTTGGTCTACCTGGATGTTTGGGGATTTCAAGTGAAAAGGATGCTCTCAAGTTTAGGAGTCTCTTGAGAGATCGATTCCATGTTGAGGTTCCTGTATATCATTGTTCTCCAAAGGATGGTGAGAATGGGAGCAGTTCTGTGACTGGGTATGTGAGAATTTCTCATCAGGTGTATAATGTGGAGGATGACTACATAAGGCTCAGGGATGCAATAAACAAACTTGTTCAGGACGGATTCAATTGCACAAAGCTGCCATCCAGTTAG
Protein:  
MDPHRDGHAENGRHDGDGGGDDTNGPLPKRPRPSPPISTAEIRDEFSHHDPAVARVNNGSFGSCPASVLAAQLRWQRLFLRQPDDFYFNRLQHSILRSRALIKDLINADDIDEVSLVDNATTAAAIVLQHVSWSFTEGHFNKGDAVVMLHYAYGAVKKSIQAYVTRAGGHVIEVPLPFPVTSNEEIVQEFRKALELGKSNDRKVRLAVIDHITSMPSVVIPVKELTKICREEGVDQVFVDAAHAIGSIEVDVKDIGADFYTSNLHKWFFCPPSVAFLYSKKSRASSNLHHPVVSHEYGNGLPIESGWVGNRDYSAQLVVPSVMDFIDRFEGGIEGIRKQNHDKVVEMGKMLAESWLTCLGSPPDMCSSMIMVGLPGCLGISSEKDALKFRSLLRDRFHVEVPVYHCSPKDGENGSSSVTGYVRISHQVYNVEDDYIRLRDAINKLVQDGFNCTKLPSS